首页> 外文OA文献 >Confounder Adjustment in Multiple Hypothesis Testing
【2h】

Confounder Adjustment in Multiple Hypothesis Testing

机译:多假设检验中的混淆调整

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We consider large-scale studies in which thousands of significance tests areperformed simultaneously. In some of these studies, the multiple testingprocedure can be severely biased by latent confounding factors such as batcheffects and unmeasured covariates that correlate with both primary variable(s)of interest (e.g. treatment variable, phenotype) and the outcome. Over the pastdecade, many statistical methods have been proposed to adjust for theconfounders in hypothesis testing. We unify these methods in the sameframework, generalize them to include multiple primary variables and multiplenuisance variables, and analyze their statistical properties. In particular, weprovide theoretical guarantees for RUV-4 and LEAPP, which correspond to twodifferent identification conditions in the framework: the first requires a setof "negative controls" that are known a priori to follow the null distribution;the second requires the true non-nulls to be sparse. Two different estimatorswhich are based on RUV-4 and LEAPP are then applied to these two scenarios. Weshow that if the confounding factors are strong, the resulting estimators canbe asymptotically as powerful as the oracle estimator which observes the latentconfounding factors. For hypothesis testing, we show the asymptotic z-testsbased on the estimators can control the type I error. Numerical experimentsshow that the false discovery rate is also controlled by the Benjamini-Hochbergprocedure when the sample size is reasonably large.
机译:我们考虑大规模研究,其中数千项显着性检验同时进行。在这些研究中的某些研究中,多重测试过程可能会因潜在的混杂因素(例如批处理效应和与目标主要变量(例如治疗变量,表型)相关的未测量的协变量)严重偏差。在过去的十年中,已经提出了许多统计方法来调整假设检验中的混杂因素。我们将这些方法统一在同一框架中,将它们概括为包括多个主要变量和多个有害变量,并分析其统计属性。特别是,我们为RUV-4和LEAPP提供了理论上的保证,它们对应于框架中的两个不同的识别条件:第一个需要先验地遵循零分布的一组“负控制”;第二个需要真正的非-控制。 null为稀疏。然后,将基于RUV-4和LEAPP的两种不同的估计器应用于这两种情况。我们表明,如果混杂因素很强,则所得估计量可能与观察潜在混杂因素的甲骨文估计量渐近地强大。对于假设检验,我们展示了基于估计量的渐近z检验可以控制I型误差。数值实验表明,当样本量较大时,错误识别率也受Benjamini-Hochberg过程控​​制。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号